Very Sparse Stable Random Projections, Estimators and Tail Bounds for Stable Random Projections

نویسنده

  • Ping Li
چکیده

The method of stable random projections [39, 41] is popular for data streaming computations, data mining, and machine learning. For example, in data streaming, stable random projections offer a unified, efficient, and elegant methodology for approximating the lα norm of a single data stream, or the lα distance between a pair of streams, for any 0 < α ≤ 2. [18] and [20] applied stable random projections for approximating the Hamming norm and the max-dominance norm, respectively, using very small α. Another application is to approximate all pairwise lα distances in a data matrix to speed up clustering, classification, or kernel computations. Given that stable random projections have been successful in various applications, this paper will focus on three different aspects in improving the current practice of stable random projections.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Estimators and tail bounds for dimension reduction in lα (0 < α ≤ 2) using stable random projections

Abstract The method of stable random projections is popular in data stream computations, data mining, information retrieval, and machine learning, for efficiently computing the lα (0 < α ≤ 2) distances using a small (memory) space, in one pass of the data. We propose algorithms based on (1) the geometric mean estimator, for all 0 < α ≤ 2, and (2) the harmonic mean estimator, only for small α (e...

متن کامل

Sparse Recovery with Very Sparse Compressed Counting

Compressed1 sensing (sparse signal recovery) often encounters nonnegative data (e.g., images). Recently [11] developed the methodology of using (dense) Compressed Counting for recovering nonnegative Ksparse signals. In this paper, we adopt very sparse Compressed Counting for nonnegative signal recovery. Our design matrix is sampled from a maximally-skewed α-stable distribution (0 < α < 1), and ...

متن کامل

Binary and Multi-Bit Coding for Stable Random Projections

We develop efficient binary (i.e., 1-bit) and multi-bit coding schemes for estimating the scale parameter of α-stable distributions. The work is motivated by the recent work on one scan 1-bit compressed sensing (sparse signal recovery) [12] using α-stable random projections, which requires estimating of the scale parameter at bits-level. Our technique can be naturally applied to data stream com...

متن کامل

Nonlinear Estimators and Tail Bounds for Dimension Reduction in l 1 Using Cauchy Random Projections

For 1 dimension reduction in l1, the method of Cauchy random projections multiplies the original data matrix A ∈ R with a random matrix R ∈ R (k ≪ min(n,D)) whose entries are i.i.d. samples of the standard Cauchy C(0, 1). Because of the impossibility results, one can not hope to recover the pairwise l1 distances in A from B = AR ∈ R, using linear estimators without incurring large errors. Howev...

متن کامل

RANDOM PROJECTIONS Margin-constrained Random Projections And Very Sparse Random Projections

Abstract We1 propose methods for improving both the accuracy and efficiency of random projections, the popular dimension reduction technique in machine learning and data mining, particularly useful for estimating pairwise distances. Let A ∈ Rn×D be our n points in D dimensions. This method multiplies A by a random matrix R ∈ RD×k, reducing the D dimensions down to just k . R typically consists ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/cs/0611114  شماره 

صفحات  -

تاریخ انتشار 2006